Open-Source Portuguese-Spanish Machine Translation
نویسندگان
چکیده
This paper describes the current status of development of an open-source shallow-transfer machine translation (MT) system for the [European] Portuguese ↔ Spanish language pair, developed using the OpenTrad Apertium MT toolbox (www.apertium.org). Apertium uses finite-state transducers for lexical processing, hidden Markov models for part-of-speech tagging, and finite-state-based chunking for structural transfer, and is based on a simple rationale: to produce fast, reasonably intelligible and easily correctable translations between related languages, it suffices to use a MT strategy which uses shallow parsing techniques to refine word-for-word MT. This paper briefly describes the MT engine, the formats it uses for linguistic data, and the compilers that convert these data into an efficient format used by the engine, and then goes on to describe in more detail the pilot Portuguese↔Spanish linguistic data.
منابع مشابه
An open-source shallow-transfer machine translation toolbox: consequences of its release and availability
By the time Machine Translation Summit X is held in September 2005, our group will have released an open-source machine translation toolbox as part of a large government-funded project involving four universities and three linguistic technology companies from Spain. The machine translation toolbox, which will most likely be released under a GPL-like license includes (a) the open-source engine i...
متن کاملAn Open-Source Shallow-Transfer Machine Translation Engine for the Romance Languages of Spain
We present the current status of development of an open-source shallow-transfer machine translation engine for the Romance languages of Spain (the main ones being Spanish, Catalan and Galician) as part of a larger government-funded project which includes non-Romance languages such as Basque and involving both universities and linguistic technology companies. The machine translation architecture...
متن کاملLCC's PowerAnswer at QA@CLEF 2006
This paper reports on Language Computer Corporation’s first QA@CLEF participation. For this exercise, we integrated our open-domain PowerAnswer question answering system with our statistical machine translation engine. For 2006, we participated in the English-to-Spanish, French and Portuguese cross-language tasks. We took the approach of intermediate translation, only processing English within ...
متن کاملKeynote Lecture 1: Practical Use of Machine Translation in International Organizations
WIPO has developed its own MT tool, initially based on open-source Moses (phrase-based SMT PBSMT), more recently based on open-source for Neural Machine Translation (NMT), namely Nematus and AmunMT. The PBSMT tool, trained on different documents, has been successfully installed in the UN (United Nations) and in other international organizations (ITU, IMO, FAO, WTO, ILO, WTO, TGF and FAO). The t...
متن کاملDevelopment of an open-source web-based intervention for Brazilian smokers – Viva sem Tabaco
BACKGROUND Web-based interventions for smoking cessation available in Portuguese do not adhere to evidence-based treatment guidelines. Besides, all existing web-based interventions are built on proprietary platforms that developing countries often cannot afford. We aimed to describe the development of "Viva sem Tabaco", an open-source web-based intervention. RESULTS The development of the int...
متن کامل